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Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) OR THIRTY (30) DAYS. 
WHICHEVER IS LONGER, FROM THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 1 33). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent temi adjustment. See 37 CFR 1 .704(b). 

Status 

1 )K Responsive to communication(s) filed on 05 October 2007 . 
2a)K This action is FINAL. 2b)n This action is non-final. 

3) 0 Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under £x parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) S Claim(s) 1-36 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) n Claim(s) is/are allowed. 

6) S Claim(s) 1-4.8-14. 18-21.26 and 31-36 is/are rejected. 

7) ^ Claimfs^ 5-7. 15-17.22-25 and 27-30 is/are obiected to. 

8) n Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) S The specification is objected to by the Examiner. 

10)0 The drawing{s) filed on is/are: a)^ accepted or b)n objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 

Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 
1 The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 
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1 .□ Certified copies of the priority documents have been received. 

2. n Certified copies of the priority documents have been received in Application No. . 

3. n Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 

1. Applicant's amendment dated October 5, 2007, responding to the Office 
action mailed July 6, 2007 provided in the rejection of claims 1-36, wherein 
claims 1. 5, 10-12, 15, 21, 26, 31, and 34 are amended. 

Claims 1-36 remain pending in the application and which have been fully 
considered by the examiner. 

Applicant's arguments with respect to claims rejection have been fully considered 
but are moot in view of the new grounds of rejection - see Ziegler et al,, art made of 
record, as applied hereto. 

No amended specification received with the amendment dated October 5, 2007. 

Applicant's amendment necessitated the new ground(s) of rejection 
presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See 
MPEP § 706.07(a). 

Applicant is reminded of the extension of time policy as set forth in 37 CFR 
1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until 
after the end of the THREE-MONTH shortened statutory period, then the shortened 
statutory period will expire on the date the advisory action is mailed, and any extension 
fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory 
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action. In no event, however, will the statutory period for reply expire later than SIX 
MONTHS from the date of this final action. 



Specification Objections 

2. The specification is objected to because the following informalities: 

• "As depicted in FIG. 2B, PPS 280 may be", cited in [00029], Line 2, 
should be corrected as "As depicted in FIG. 2A, PPS 280 may be", 

• "a D-stage processor pipeline of network processor 500 of FIG. 1", 
cited in [00029], Lines 4-5. should be corrected as "a D-stage 
processor pipeline of network processor 500 of FIG. 8" 

Appropriate correction is required 



Claim Rejections - 35 USC § 103(a) 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or 

described as set forth in section 102 of this title, if the differences between the subject matter 
sought to be patented and the prior art are such that the subject matter as a whole would 
have been obvious at the time the invention was made to a person having ordinary skill in the 
art to which said subject matter pertains. Patentability shall not be negatived by the manner 
in which the invention was made. 



3. Claims 1, 11, 21, 26, 31, and 34 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Lakshmanamurthy et al. {"Network Processor 
Performance Analysis Methodology", Aug, 15, 2002, Intel Technology Journal) 
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(hereinafter ^Lakshmanamurth/) in view of Ziegler et al. (Compiler-Generated 
Communication for Pipelined FPGA Applications, June 2, 2003, ACM, pp, 610- 
615) (hereinafter 'Ziegler*) 

4. As to claim 1 (Currently Amended), Lakshmanamurthy discloses a 
method comprising: 

• configuring one or more processors into a D-stage processor pipeline 
(e.g.. Sec. of "ABSTRACT". 1®* Par. - this paper describes the 
performance analysis methodology developed to analyze the perfonnance 
of various networking applications that are targeted for running on the IXP 
2400 network processor , the second-generation IXA network processor : 
P. 19, L-CoL, 3^^ Par, Lines 4-9 - this methodology involves diving the 
application into pipeline blocks ., and latency budget for each pipeline 
element, and mapping the application blocks to software paradigms and 
the hardware resources). 
Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for running on the IXP2400 network processor (e.g.. Abstract, 1®* Par.), 
but does not explicitly disclose the followings: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing state 
(PPS) of the sequential network application programs; and 
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• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network 
application program. 

However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses the followings: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing state 
(PPS) of the sequential network application programs (e.g., Abstract, 
Lines 1-4 - ... a set of compiler analyses and an implementation that 
automatically map a sequential and un-annotated C program into a 
pipelined implementation targeted for an FPGA Lines 5-12 - ...from 
parallelizing compilers to identify pipeline stages using the results of 
this analysis, we automatically generate application-specific pipelined 
FPGA hardware designs : Fig. 1 - MVIS Kernel Analysis; Sec. 1 
Introduction, 2"^ Par., 2"*^ Bullet - ... an implementation of the analyses 
and code transformations required to automatically design and synthesize 
pipelines tailored to seouential program characteristics : 2"*^ Par., Lines 4-7 
- ... a set of compiler analyses to derive, from a seouential algorithm 
description, the components of pipelined design ., i.e., task parallelism and 
communication reouirements : Fig. 2 - DEFACTO (Design Environment for 
Adaptive Computing Technology) Compilation Flow, steps of 
'Communication and Pipeline Analysis', *Code Transformations', 'SUIF to 
VHDL', 'Behavioral Synthesis and Estimation'; Sec. 3 Compiler Analysis, 
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2"^ Par. - The DEFACTO system-level compiler takes an un-annotated, 
sequential program and applies a set of communication and pipeline 
analyses based on array data-flow analysis : Fig. 3 - Communication Edge 
Calculation; Sec. 3.4, 1st Par. - Once we identify the best granularity, we 
then calculate the specific communication placement and array data 
section to be communicated and form a CEP (Communication Edge 
Descriptor)); and 

• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network application 
program (e.g.. Sec. 1 Introduction, 2"^ Par. - .. the complexity and 
sophistication of pipelined execution make automatic tools that can 
analyze sequential applications and derive pipelined implementations 
extremely desirable; ... we combined these analyses with behavioral 
synthesis tools to automatically synthesize application-specific pipelines 
onto a tarqet FPGA-based architecture ). 

Therefore, it would have been obvious to one of ordinary skill in the art, at the 
time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the followings in 
Lakshmanamurthy system: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing state 
(PPS) of the sequential network application programs; and 
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• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network application 
program. 

The motivation is that it would further enhance the Lakshmanamurthy's 

system by taking, advancing and/or incorporating Ziegler's system which offers 

significant advantages for a set of compiler analvses and an implementation that 

automaticallv map a sequential and un-annotated C program into a pipelined 

implementation targeted for an FPGA with multiple external memories as once 

« 

suggested by Ziegler (e.g., Abstract). 

5. As to claim 11 (Currently Amended), Lakshmanamurthy discloses an 
article of manufacture including a machine readable medium having stored 
thereon instructions which may be used to program a system to perform a 
method, comprising: 

• configuring one or more processors into a D-stage processor pipeline 
(e.g.. Sec. of "ABSTRACT", 1^* Par. - this paper describes the 
performance analysis methodology developed to analyze the performance 
of various networking applications that are targeted for running on the IXP 
2400 network processor , the second-generation IXA network processor : 
P. 19, L-Col., 3^^ Par., Lines 4-9 - this methodology involves diving the 
application into pipeline blocks ., and latency budget for each pipeline 
element, and mapping the application blocks to software paradigms and 
the hardware resources ). 
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Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for running on the IXP2400 network processor (e.g.. Abstract, 1^* Par.), 
but does not explicitly disclose the followings: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing stage 
(PPS) of the sequential network application program; and 

• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network application 
program. 

However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses the followings: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing stage 
(PPS) of the sequential network application program (e.g., Abstract, Lines 
1-4 - ... a set of compiler analyses and an implementation that 
automaticallv map a seouential and un-annotated C program into a 
pipelined implementation targeted for an FPGA ..: Lines 5-12 - ...from 
parallelizing compilers to identifv pipeline stages ...; using the results of 
this analysis, we automaticallv generate application-specific pipelined 
FPGA hardware designs : Fig. 1 - MVIS Kernel Analysis; Sec. 1 
Introduction, 2"^ Par., 2^^^ Bullet - ... an implementation of the analyses 
and code transformations required to automaticallv design and synthesize 
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pipelines tailored to sequential program characteristics : 2"^ Par, Lines 4-7 
- ... a set of compiler analyses to derive, from a sequential algorithm 
description, the components of pipelined design ., i.e., task parallelism and 
communication reguirements : Fig. 2 - DEFACTO (Design Environment for 
Adaptive Computing Technology) Compilation Flow, steps of 
'Communication and Pipeline Analysis', 'Code Transformations', 'SUIF to 
VHDL', 'Behavioral Synthesis and Estimation'; Sec. 3 Compiler Analysis, 
2"^ Par. - The DEFACTO system-level compiler takes an un-annotated, 
sequential program and applies a set of communication and pipeline 
analvses based on arrav data-flow analysis : Fig. 3 - Communication Edge 
Calculation; Sec. 3.4, 1st Par. - Once we identify the best granularity, we 
then calculate the specific communication placement and arrav data 
section to be communicated and form a CEP (Communication Edge 
Descriptor)); and 

• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network application 
program (e.g.. Sec. 1 Introduction, 2"^ Par. - .. the complexity and 
sophistication of pipelined execution make automatic tools that can 
analvze seouential applications and derive pipelined implementations 
extremely desirable; ... we combined these analyses with behavioral 
synthesis tools to automaticallv synthesize application-specific pipelines 
onto a target FPGA-based architecture ). 
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Therefore, it would have been obvious to one of ordinary skill in the art. at the 
time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the followings in 
Lakshmanamurthy system: 

• transforming a sequential network application program into D-pipeline 
stages that collectively perform the sequential packet processing stage 
(PPS) of the sequential network application program; and 

• executing the D-pipeline stages in parallel within the D-stage processor 
pipeline to provide parallel execution of the sequential network application 
program. 

The motivation is that it would further enhance the Lakshmanamurthy's 
system by taking, advancing and/or incorporating Ziegler's system which offers 
significant advantages for a set of compiler analyses and an implementation that 
automaticallv map a sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA with multiple external memories as once 
suggested by Ziegler (e.g., Abstract). 

6. As to claim 21 (Currently Amended), Lakshmanamurthy discloses a 
method comprising: 

• constructing a flow network model from a sequential network application 
program (e.g.. Sec. of "ABSTRACT", 1®* Par. - this paper describes the 
performance analysis methodology developed to analyze the performance of 
various networking applications that are targeted for running on the IXP 2400 
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network processor , the second-generation IXA network processor : Sec. 
Introduction, 2"^ Par, Lines 18-23 - in analyzing the performance of 
networking applications running on the IXP2400 network processor and 
presents a case study using the IPv4 forwarding + DiffServ application ; 3^^ 
Par. - ... a detailed data movement model of the target application . This 
model describes the various operations performed bv the network processor 
on every received packet) : 
• cutting the flow network model into a plurality of preliminary pipeline stages 
(e.g., P. 19, L-Col., 3"^ Par., Lines 4-9 - this methodology involves diving the 
application into pipeline blocks ., and latency budget for each pipeline 
element, and mapping the application blocks to software paradigms and the 
hardware resources ). 

Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for running on the IXP2400 network processor (e.g., Abstract, 1^* Par.), 
but does not explicitly disclose: 

• transforming the preliminary pipeline stages to perform control flow and 
variable transmission therebetween to form D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the sequential network application program. 
However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses: 
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• transforming the preliminary pipeline stages to perfonrn cx)ntrol flow and 
variable transmission therebetween to form D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the sequential network application program (e.g.. 
Abstract, Lines 1-4 - ... a set of compiler analyses and an 
implementation that automatically map a sequential and un-annotated 
C program into a pipelined implementation targeted for an FPGA ..; 
Lines 5-12 - ... from parallelizing compilers to identifv pipeline stages 
. . .; using the results of this analysis, we automaticallv generate 
application-specific pipelined FPGA hardware designs : Fig. 1 - MVIS 
Kernel Analysis; Sec. 1 Introduction, 2""^ Par, 2""^ Bullet «... an 
implementation of the analyses and code transformations required to 
automaticallv design and synthesize pipelines tailored to seguential 
program characteristics : 2"^ Par., Lines 4-7 - ... a set of compiler 
analyses to derive, from a seguential algorithm description, the 
components of pipelined design ., i.e.. task parallelism and 
communication reguirements : Fig. 2 - DEFACTO (Design Environment 
for Adaptive Computing Technology) Compilation Flow, steps of 
'Communication and Pipeline Analysis', 'Code Transformations', *SUIF 
to VHDL*, 'Behavioral Synthesis and Estimation'; Sec. 3 Compiler 
Analysis, 2"^ Par. - The DEFACTO system-level compiler takes an un- 
annotated, seguential program and applies a set of communication and 
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pipeline analyses based on array data-flow analysis : Fig. 3 - 
Communication Edge Calculation; Sec. 3.4, 1st Par. - Once we identify 
the best granularity, we then calculate the specific communication 
placement and array data section to be communicated and form a 
CEP (Communication Edge Descriptor)). 
Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the following in Lakshmanamurthy 
system: 

• transforming the preliminary pipeline stages to perfonn control flow and 
variable transmission therebetween to form D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the sequential network application program. 
The motivation is that it would further enhance the Lakshmanamurthy's 
system by taking, advancing and/or incorporating Ziegler's system which offers 
significant advantages for a set of compiler analyses and an implementation that 
automatically map a sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA with multiple external memories as once 
suggested by Ziegler (e.g.. Abstract). 



7. As to claim 26 (Currently Amended), Lakshmanamurthy discloses an 
article of manufacture including a machine readable medium having stored 



Application/Control Number: Page 14 

• 10/714.465 
Art Unit: 2192 

thereon instructions which may be used to program a system to perform a 
method, comprising: 

• constructing a flow network model from a sequential network application 
program (e.g., Sec. of "ABSTRACT", 1^* Par. - this paper describes the 
performance analysis methodology developed to analyze the perfomiance of 
various networking applications that are targeted for running on the IXP 2400 
network processor the second-generation IXA network processor : Sec. 
Introduction, 2"^ Par., Lines 18-23 - in analyzing the performance of 
networking applications running on the IXP2400 network processor and 
presents a case study using the IPv4 forwarding + DIffServ application : 3"^^ 
Par. "... a detailed data movement model of the target application . This 
model describes the various operations performed bv the network processor 
on every received packet) ; 

• cutting the flow network model into a plurality of preliminary pipeline stages 
(e.g., P. 19, L-Col., 3^^ Par., Lines 4-9 - this methodology involves diving the 
application into pipeline blocks ., and latency budget for each pipeline 
element, and mapping the application blocks to software paradigms and the 
hardware resources ). 

Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for running on the IXP2400 network processor (e.g.. Abstract, 1®* Par), 

but does not explicitly disclose: 
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• transforming the preliminary pipeline stages to perfonm control flow and 
variable transmission therebetween in order to form D-pipeline stages 
that collectively perfonn the sequential packet processing stage (PPS) 
of the sequential network application program to enable parallel 
execution of the sequential network application program. 

However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses: 

• transforming the preliminary pipeline stages to perform, control flow and 
variable transmission therebetween in order to form D-pipeline stages 
that collectively perform the sequential packet processing stage (PPS) 
of the sequential network application program to enable parallel 
execution of the sequential network application program (e.g., 
Abstract, Lines 1-4 - ... a set of compiler analyses and an 
implementation that automaticallv map a sequential and un-annotated 
C program into a pipelined implementation targeted for an FPGA ..; 
Lines 5-12 - ... from parallelizing compilers to identify pipeline stages 

using the results of this analysis, we automaticallv generate 
application-specific pipelined FPGA hardware designs ; Fig. 1 - MVIS 
Kernel Analysis: Sec. 1 Introduction. 2""* Par.. 2"^ Bullet - ... an 
implementation of the analyses and code transformations required to 
automaticallv design and synthesize pipelines tailored to sequential 
program characteristics ; 2"^ Par., Lines 4-7 - ... a set of compiler 
analyses to derive, from a seguential algorithm description, the 
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components of pipelined design ., i.e., task parallelism and 
communication requirements : Fig. 2 - DEFACTO (Design Environment 
for Adaptive Computing Technology) Compilation Flow, steps of 
'Communication and Pipeline Analysis', 'Code Transformations', 'SUIF 
to VHDL', 'Behavioral Synthesis and Estimation'; Sec. 3 Compiler 
Analysis, 2"^ Par. - The DEFACTO system-level compiler takes an un- 
annotated, sequential program and applies a set of communication and 
pipeline analvses based on arrav data-flow analysis : Fig. 3 - 
Communication Edge Calculation; Sec. 3.4, 1st Par. - Once we identify 
the best granularity, we then calculate the specific communication 
placement and arrav data section to be communicated and form a 
CED (Communication Edge Descriptor)). 
Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the following in Lakshmanamurthy 
system: 

• transforming the preliminary pipeline stages to perform control flow and 
variable transmission therebetween in order to form D-pipeline stages 
that collectively perform the sequential packet processing stage (PPS) 
of the sequential network application program to enable parallel 
execution of the sequential network application program. 
The motivation is that it would further enhance the Lakshmanamurthy's system 
by taking, advancing and/or incorporating Ziegler's system which offers 
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significant advantages for a set of compiler analyses and an implementation that 
automatically map a sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA with multiple external memories as once 
suggested by Ziegler (e.g., Abstract). 

8. As to claim 31 (Currently Amended), Lakshmanamurthy discloses an 
apparatus, comprising: 

• a processor (e.g., Sec. of "ABSTRACT", 1®^ Par. - this paper describes the 
performance analysis methodology developed to analyze the performance of 
various networking applications that are targeted for running on the IXP 2400 
network processor the second-generation IXA network processor ; Fig. 1 - 
IXP 2400 external interface, element of "IXP 2400"; P. 20, R-Col., 1'* Par.; P. 
21 , L-Col., 3'"* Par. - IXP 2400 contains eight multi-threaded, packet- 
processing micro-engines : these micro-engines are highly programmable 
packet processors and support multi threading of up to eight threads each; 
each micro-engine provides a variety of network processing functions in 
hardware; P. 21 , R-CoL, 2""^ Par. - the IXP 2400 also has an integrated low- 
power general-purpose Intel® Xscale™ micro-architecture core : the 
integrated Xscale™ process offers ample processing power for running 
control plane software); 

• a memory coupled to the processor (e.g., P. 20, L-Col., 4"^ Par. - extern 
DRAM and SRAM: P. 20, L-Col., 4^ Par. - P. 21, L-Col., 1'* Par. - the SRAM 
is primarily used for packet descriptors, queue descriptors, counters, and 
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other data structures; Fig. 2 - IXP 2400 internal architecture, elements of 
"QDR SRAM", "DDRAM"; Fig. 3 - IXP 2400-based OC-48 line card 
configuration, element of " DDR SDRAM "). 

Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for running on the IXP240Q network processor (e.g., Abstract, 1^* Par.), 
but does not explicitly disclose: 

• the memory including a compiler to cause transformation of a 
sequential network application program into D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the D-pipeline stages within a D-stage processor pipeline 
to provide parallel execution of the sequential network application 
program. 

However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses: 

• the memory including a compiler to cause transformation of a . 
sequential network application program into D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the D-pipeline stages within a D-stage processor pipeline 
to provide parallel execution of the sequential network application 
program (e.g., Abstract, Lines 1-4 - ... a set of compiler analyses and 
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an implementation that automatically map a sequential and un- 
annotated C program into a pipelined implementation targeted for an 
FPGA ..; Lines 5-12 - ... from parallelizing compilers to identify pipeline 
stages using the results of this analysis, we automatically generate 
application-specific pipelined FPGA hardware designs : Fig. 1 - MVIS 
Kernel Analysis; Sec. 1 Introduction, 2""^ Par, 2""* Bullet «... an 
implementation of the analyses and code transformations required to 
automatically design and synthesize pipelines tailored to seguential 
program characteristics : 2"^ Par, Lines 4-7 - ... a set of compiler 
analyses to derive, from a seguential algorithm description, the 
components of pipelined design ., i.e., task parallelism and 
communication reguirements : Fig. 2 - DEFACTO (Design Environment 
for Adaptive Computing Technology) Compilation Flow, steps of 
'Communication and Pipeline Analysis', 'Code Transfonmations', 'SUIF 
to VHDL\ 'Behavioral Synthesis and Estimation'; Sec. 3 Compiler 
Analysis, 2"^ Par - The DEFACTO system-level compiler takes an un- 
annotated. seguential program and applies a set of communication and 
pipeline analyses based on array data-flow analysis : Fig. 3 - 
Communication Edge Calculation; Sec. 3.4, 1st Par - Once we identify 
the best granularity, we then calculate the specific communication 
placement and array data section to be communicated and form a 
CED (Communication Edge Descriptor)). 
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Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the following in Lakshmanamurthy 
system: 

• the memory including a compiler to cause transformation of a 
sequential network application program into D-pipeline stages that 
collectively perform the sequential packet processing stage (PPS) of 
the sequential network application program to enable parallel 
execution of the D-pipeline stages within a D-stage processor pipeline 
to provide parallel execution of the sequential network application 
program. 

The motivation is that it would further enhance the Lakshmanamurthy's 
system by taking, advancing and/or incorporating Ziegler's system which offers 
significant advantages for a set of compiler analvses and an implementation that 
automaticallv map a sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA with multiple external memories as once 
suggested by Ziegler (e.g.. Abstract). 

9. As to claimi 34 (Currently Amended), Lakshmanamurthy discloses a 
system comprising: 

• a processor (e.g., Sec. of "ABSTRACT", 1®* Par. - this paper describes the 
performance analysis methodology developed to analyze the performance of 
various networking applications that are targeted for running on the IXP 2400 
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network processor , the second-generation IXA network processor : Fig. 1 - 
IXP 2400 external interface, element of "IXP 2400"; P. 20. R-Col.. 1^* Par.; P. 
21. L-Col., 3"* Par. - IXP 2400 contains eight multi-threaded, packet- 
processing micro-engines : these micro-engines are highly programmable 
packet processors and support multi threading of up to eight threads each; 
each micro-engine provides a variety of network processing functions in 
hardware : P. 21 , R-CoL, 2"*^ Par. - the IXP 2400 also has an integrated low- 
power general-purpose Intel® Xscale^*^ micro-architecture core : the 
integrated Xscale™ process offers ample processing power for running 
control plane software); 

• a memory controller coupled to the processor (e.g.. P. 21 , L-CoL, 3^^ Par, 
Lines 13-14 - the memory controllers facilitate efficient access to the of-chip 
SRAM and DRAM); and 

• a DDR SRAM memory coupled to the processor (e.g., P. 20, L-CoL, 4"^ Par. - 
extern DRAM and SRAM : P. 20, L-Col.. 4^'' Par. P. 21 , L-CoL, 1^^ Par. - the 
SRAM is primarily used for packet descriptors, queue descriptors, counters, 
and other data structures; Fig. 2 - IXP 2400 internal architecture, element of 
"QDR SRAM"; Fig. 3 - IXP 2400-based OC-48 line card configuration, 
element of " DDR SDRAM "). 

Lakshmanamurthy discloses the performance analysis methodology 
developed to analyze the performance of various networking applications that are 
targeted for mnning on the IXP2400 network processor (e.g., Abstract, 1^* Par.), 
but does not explicitly disclose: 
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• the memory including a compiler to cause transformation of a 
sequential network application program into D-application program 
stages that collectively perfonn the sequential packet processing stage 
(PPS) of the sequential network application program to enable parallel 
execution of the D-application program stages within a D-stage 
processor pipeline to provide parallel execution of the sequential 
network application program. 

However, in an analogous art of Compiler-Generated Communication for 
Pipelined FPGA Applications, Ziegler discloses: 

• the memory including a compiler to cause transformation of a 
sequential network application program into D-application program 
stages that collectively perform the sequential packet processing stage 
(PPS) of the sequential network application program to enable parallel 
execution of the D-application program stages within a D-stage 
processor pipeline to provide parallel execution of the sequential 
network application program (e.g.. Abstract, Lines 1-4 - ... a set of 
compiler analyses and an implementation that automaticallv map a 
sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA ..; Lines 5-12 - from 
parallelizing compilers to identifv pipeline stages ...; using the results 
of this analysis, we automaticallv generate application-specific 
pipelined FPGA hardware designs : Fig. 1 - MVIS Kernel Analysis; 
Sec. 1 Introduction, 2"^ Par, 2"*^ Bullet - ... an implementation of the 
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analyses and code transformations required to automatically design 
and synthesize pipelines tailored to sequential program characteristics ; 
2"^ Par., Lines 4-7 - ... a set of compiler analyses to deriye, from a 
sequential algorithm description, the components of pipelined design ., 
i.e., task parallelism and communication reouirements : Fig. 2 - 
DEFACTO (Design Enyironment for Adaptive Computing Technology) 
Compilation Flow, steps of 'Communication and Pipeline. Analysis', 
'Code Transformations', *SUIF to VHDL\ 'Behavioral Synthesis and 
Estimation'; Sec. 3 Compiler Analysis, 2"^ Par. - The DEFACTO 
system-level compiler takes an un-annotated, seguential program and 
applies a set of communication and pipeline analyses based on array 
data-flow analysis : Fig. 3 - Communication Edge Calculation; Sec. 3.4, 
1st Par. - Once we identify the best granularity, we then calculate the 
specific communication placement and array data section to be 
communicated and form a CED (Communication Edge Descriptor)). 
Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Ziegler into the 
Lakshmanamurthy's system to further provide the following in Lakshmanamurthy 
system: 

• the memory including a compiler to cause transformation of a sequential 
network application program into D-application program stages that 
collectively perform the sequential packet processing stage (PPS) of the 
sequential network application program to enable parallel execution of the 
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D-application program stages within a D-stage processor pipeline to 
provide parallel execution of the sequential network application program. 
The motivation is that it would further enhance the Lakshmanamurthy's system 
by taking, advancing and/or incorporating Ziegler's system which offers 
significant advantages for a set of compiler analvses and an implementation that 
automaticallv map a sequential and un-annotated C program into a pipelined 
implementation targeted for an FPGA with multiple external memories as once 
suggested by Ziegler (e.g., Abstract). 

10. Claims 2. 12. 32. and 35 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Lakshmanamurthy in view of Ziegler and further in view of 
Rakhmatov et aL, {''Hardware-Software Bipartitioning for Dynamically 
Reconfigurable Systems", May 2002, ACM) (hereinafter 'Rakhmatov') 

11. As to claim 2 (Original) (incorporating the rejection in claim 1 ), 
Lakshmanamurthy discloses network processor performance analysis 
methodology (e.g., Sec. of "ABSTRACT". 3'^ Par), but Lakshmanamurthy and 
Ziegler do not explicitly disclose transforming the sequential application program 
comprises constructing a flow network model for the sequential application 
program; selecting a plurality of preliminary pipeline stages from the flow network 
model; and modifying the preliminary pipeline stages to perform control flow and 
variable transmission therebetween to form the D-pipeline stages. 
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However, in an analogous art of hardware-software bi-partitioning for 
dynamically reconfigurable systems, Rakhmatov discloses transforming the 
sequential application program comprises constructing a flow network model for 
the sequential application program; selecting a plurality of preliminary pipeline 
stages from the flow network model; and modifying the preliminary pipeline 
stages to perform control flow and variable transmission therebetween to form 
the D-pipeline stages (e.g., Sec. of "ABSTRACT" - a method for mapping nodes 
of an application control flow graph either to software or reconfigurable hardware, 
explicitly targeting minimization of the energy-delay cost due to both computation 
and configuration; using network flow techniques, after transforming the original 
control flow graph into an equivalent network; P. 145, R-Col.. 1®* Par. through 3^^ 
Par. - the software can directly configure the hardware, which is partially 
reconfigurable; partial reconfiguration allows for a selective change of hardware 
segments of arbitrary size at an arbitrary location, without disrupting the 
operation of the rest of the hardware space; such a capability greatly reduces 
reconfiguration time and energy consumption, because the hardware updates 
are highly localized. Three types of problems: (1) energy-delay product 
minimization, (2) energy minimization under the delay constraint, and (3) delay 
minimization under the energy constraint can be resolved via using network flow 
techniques; specifically, the cost of a node depends whether it is in software or in 
hardware and the cost of an edge depends whether its origin node is in software 
or in hardware and whether its destination node is in software or in hardware; P. 
146, L-Col.. 2"^ Par. - 4*^ Par. - the cost/weight can be either the energy or the 
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delay or the energy-delay product of a node/edge, weighted by its execution 
frequency; first, transferring control from a hardware block to a software block is 
more expensive than transferring control from a software block to a software 
block; second, transferring control from a software block to a hardware block is 
more expensive than transferring control from a hardware block to a hardware 
block; P. 147, Sec. of "Constrained Bipartitioning Algorithms" - cost-driven 
constrained bi-partitioning; weight-driven constrained bi-partitioning; P. 148, R- 
CoL, 1^* Par.; Fig. 3 - proposed constrained bi-partitioning algorithms). 

Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Rakhmatov into the 
Lakshmanamurthy-Ziegler's system to further provide transforming the sequential 
application program comprises constructing a flow network model for the 
sequential application program; selecting a plurality of preliminary pipeline stages 
from the flow network model; and modifying the preliminary pipeline stages to 
perform control flow and variable transmission therebetween to form the D- 
pipeline stages in Lakshmanamurthy-Ziegler system. 

The motivation is that it would further enhance the Lakshmanamurthy- 
Ziegler's system by taking, advancing and/or incorporating Rakhmatov's system 
which offers significant advantages for providing an efficient bi-partitioning 
algorithm that finds an optimal solution for energy-delay product minimization and 
systematically searches for the best in polynomially bounded set of good 
solutions for delay-constrained energy minimization, and energy-constrained 
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delay minimization, the formulation also including costs and weights as design 
parameters as once suggested by Rakhmatov (e.g., Sec. of "CONCLUSION"). 



12. As to claim 12 (Currently Amended) (incorporating the rejection in claim 
1 1 ). please refer to claim 2 as set forth accordingly. 

13. As to claim 32 (Original) (incorporating the rejection in claim 31). 
Lakshmanamurthy discloses network processor performance analysis 
methodology (e.g.. Sec. of "ABSTRACT", 3'*^ Par.), but Lakshmanamurthy and 
Ziegler do not explicitly disclose the compiler to cause construction of a flow 
network model for the sequential application program, to cause selection of a 
plurality of preliminary pipeline stages from the flow network model and to cause 
modification of the preliminary pipeline stages to perform control flow and 
variable transformation therebetween to form the D-pipeline stages. 

However, in an analogous art of hardware-software bi-partitioning for 
dynamically reconfigurable systems, Rakhmatov discloses the compiler to cause 
construction of a flow network model for the sequential application program, to 
cause selection of a plurality of preliminary pipeline stages from the flow network 
model and to cause modification of the preliminary pipeline stages to perform 
control flow and variable transformation therebetween to form the D-pipeline 
stages (e.g., Sec. of "ABSTRACT" - a method for mapping nodes of an 
application control flow graph either to software or reconfigurable hardware, 
explicitly targeting minimization of the energy-delay cost due to both computation 
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and configuration; using network flow techniques, after transforming the original 
control flow graph into an equivalent network; P. 145, R-CoL, 1^' Par. through 3*^^ 
Par. - the software can direcfly configure the hardware, which is partially 
reconfigurable; partial reconfiguration allows for a selective change of hardware 
segments of arbitrary size at an arbitrary location, without disrupfing the 
operation of the rest of the hardware space; such a capability greatly reduces 
reconfiguration time and energy consumption, because the hardware updates 
are highly localized. Three types of problems: (1 ) energy-delay product 
minimization, (2) energy minimization under the delay constraint, and (3) delay 
minimization under the energy constraint can be resolved via using network flow 
techniques; specifically, the cost of a node depends whether it is in software or in 
hardware and the cost of an edge depends whether its origin node is in software 
or in hardware and whether its destination node is in software or in hardware; P. 
146, L-CoL, 2"^ Par. - 4*^ Par - the cost/weight can be either the energy or the 
delay or the energy-delay product of a node/edge, weighted by its execution 
frequency; first, transferring control from a hardware block to a software block is 
more expensive than transferring control from a software block to a software 
block; second, transferring control from a software block to a hardware block is 
more expensive than transferring control from a hardware block to a hardware 
block; P. 147, Sec. of "Constrained Bipartitioning Algorithms" - cost-driven 
constrained bi-partitioning; weight-driven constrained bi-partitioning; P. 148, R- 
Col., 1®* Par; Fig. 3 - proposed constrained bi-partitioning algorithms). 
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Therefore, it would have been obvious to one of ordinary skill in the art. at 
the time the invention was made to combine the teachings of Rakhmatov into the 
Lakshmanamurthy-Ziegler's system to further provide the compiler to cause 
construction of a flow network model for the sequential application program, to 
cause selection of a plurality of preliminary pipeline stages from the flow network 
model and to cause modification of the preliminary pipeline stages to perform 
control flow and variable transformation therebetween to form the D-pipeline 
stages in Lakshmanamurthy-Ziegler system. 

The motivation is that it would further enhance the Lakshmanamurthy- 
Ziegler's system by taking, advancing and/or incorporating Rakhmatov's system 
which offers significant advantages for providing an efficient bi-partitioning 
algorithm that finds an optimal solution for energy-delay product minimization and 
systematically searches for the best in polynomially bounded set of good 
solutions for delay-constrained energy minimization, and energy-constrained 
delay minimization, the formulation also including costs and weights as design 
parameters as once suggested by Rakhmatov (e.g., Sec. of "CONCLUSION"). 

14. As to claim 35 (Original) (incorporating the rejection in claim 34), please 
refer to claim 32 as set forth accordingly. 

15, Claims 3-4, 8, 10, 13-14, 18, 20. 33, and 36 are rejected under 35 U.S.C. 
103(a) as being unpatentable over Lakshmanamurthy in view of Ziegler, 
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Rakhmatov and further in view of Robschink et al., {"Efficient Patli Conditions in 
Dependence Graplis", May 2002, ACM) (hereinafter ^Robschink^) 



16. As to claim 3 (Original) (incorporating the rejection in claim 2), 
Rakhmatov discloses constructing the flow network model (e.g.. Sec. of 
"ABSTRACT" - a method for mapping nodes of an application control flow graph 
either to software or reconfigurable hardware, explicitly targeting minimization of 
energy-delay cost due to both computation and configuration; show how these 
problems can be tackled by using network flow techniques, after transforming the 
original control flow graph into an equivalent network), but Lakshmanamurthy, 
Ziegler and Rakhmatov do not explicitly disclose transforming the application 
program into a static, single-assignment form; building a control flow graph for a 
loop body of the application program; building a dependence graph based on a 
summary graph of the control flow graph and identified, strongly-connected 
components (SSC) of the control flow graph; and constructing the flow network 
model according to a summary graph of the dependence graph and identified 
SSC nodes of the dependence graph. 

However, in an analogous art of efficient path conditions in dependence 
graphs, Robschink discloses transforming the application program into a static, 
single-assignment form (e.g., Sec. 2.2 - Path conditions, 2"^ Pa. - since there 
may be assignments to the same variable at different program points, all 
programs must be transformed into static single assignment form (SSA) first. In 
SSA from, there is at most one assignment to every variable. If necessary, we 
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will distinguish different SSA-variants of a program variable by additional indices; 
P. 480, 3'^ Par - since the program is transformed to SSA form first, some 
additional constraints must be generated which represent the ^-function 
occurring in SSA form); building a control flow graph for a loop body of the 
application program; building a dependence graph based on a summary graph of 
the control flow graph (e.g., Sec. 1 - Introduction, 4^ Par. - ValSoft can build a 
dependence graph for 50000 lines of C; fonvard and backward slices or chops 
can be interactively computed and visualized in the source text; Fig. 1 - a 
mergesort program and part of its SDG (System Dependence Graph); Sec. 2.1 - 
Dependence and slices, 2"^ Par. - slices can be defined via the system 
dependence graph (SDG); Sec. 3 - Basic Analysis, 1^^ Par; Sec. 3.1 - Analyzing 
data flow, 1®* Par.) and identified, strongly-connected components (SSC) of the 
control flow graph; and constructing the flow network model according to a 
summary graph of the dependence graph and identified SSC nodes of the 
dependence graph (e.g., Sec. 4.2 - Exploiting interval analysis, 2"^ Par through 
3^ Par.). 

Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Robschink into the 
Lakshmanamurthy-Ziegler-Rakhmatov's system to further provide transforming 
the application program into a static, single-assignment form; building a control 
flow graph for a loop body of the application program; building a dependence 
graph based on a summary graph of the control flow graph and identified, 
strongly-connected components (SSC) of the control flow graph; and 
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constructing the flow network model according to a summary graph of the 
dependence graph and identified SSC nodes of the dependence graph in 
Lakshmanamurthy-Ziegler-Rakhmatov system. 

The motivation is that it would further enhance the Lakshmanamurthy- 
Ziegler-Rakhmatov's system by taking, advancing and/or incorporating 
Robschink's system which offers that path conditions in dependence graphs are 
a valuable tool for various kinds of program analysis, such as program 
understanding or safety checks as once suggested by Robschink (e.g., Sec. of 
"CONCLUSION AND FUTURE WORK". 1®* Par.). 

17. As to claim 4 (Original) (incorporating the rejection in claim 3), 
Rakhmatov discloses constructing the flow network model comprises assigning a 
unique source node and a unique sink node to the flow network model (e.g., Sec. 
4 - Proposed Solution. 1®* Par.). 

Robschink discloses adding a program node to the flow network model for 
each SSC node identified in the summary graph of the dependence graph (e.g., 
Sec. 2.2 - Path conditions, 2^^ Pa. - since there may be assignments to the 
same variable at different program points, all programs must be transformed into 
static single assignment form (SSA) first. In SSA from, there is at most one 
assignment to every variable. If necessary, we will distinguish different SSA- 
variants of a program variable by additional indices; P. 480, 3"^ Par. - since the 
program is transformed to SSA form first, some additional constraints must be 
generated which represent the ^-function occurring in SSA form); adding a 
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variable node to the flow network model for each variable that is defined and 
used by multiple program nodes; adding a control node C to the flow network 
model for each SSC node identified in the summary graph of the dependence 
graph as a source of control dependence (e.g., Sec. 1 - Introduction. 4^ Par. - 
ValSoft can build a dependence graph for 50000 lines of C; fon^^ard and 
backward slices or chops can be interactively computed and visualized in the 
source text; Fig. 1 - a mergesort program and part of its SDG (System 
Dependence Graph); Sec. 2.1 - Dependence and slices. 2"^ Par. - slices can be 
defined via the system dependence graph (SDG); Sec. 3 - Basic Analysis, 1®* 
Par.; Sec. 3.1 - Analyzing data flow. 1^* Par.). 

Further, Rakhmatov discloses generating edges having an associated 
weight to connect corresponding program nodes to corresponding variable 
nodes; generating edges having an associated weight to connect corresponding 
program nodes to corresponding control nodes; and generating edges between 
the program nodes and one of the source node and the sink node (e.g., Sec. 2 - 
Problem Description, 1®* Par. through 4**^ Par., and 6*^ Par.; P. 147, L-Col.,1^* 
Par.; Sec. 4 - Proposed Solution, 1^* Par. through 2"^ Par.; P. 147. R-CoL, 1^* 
Par. through 4*^ Par). 

18. As to claim 8 (Original) (incorporating the rejection in claim 2). 
Rakhmatov discloses selecting the plurality of preliminary pipeline stages 
comprises cutting the flow network model into D-1 successive cuts, such that 
each cut is a balanced minimum cost cut (e.g.. Sec. 1 - Introduction, 4*^ Par. - 
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cost function involve costs of all nodes and all edges in the CFG, and not just the 
edges in the cut-set separating software-mapped nodes and hardware-mapped 
nodes; specifically, the cost of a node depends whether it is in software or in 
hardware, and the cost of an edge depends whether its origin node is in software 
or in hardware and whether its destination node is in software or in hardware; 
Sec. 3 - Related Work. 2"^ Par. (Circuit Partitioning) Through 3'^ Par., 6^ Par 
(Our Contribution) - our contribution is to show how a CFG (Control Flow Graph) 
with node and edge costs can be transformed into a network, so that a minimum 
cut in the network corresponds to an optimal bi-partition of the CFG; P. 147, 1®* 
Par. through 2"^ Par.; Fig. 2 - unconstrained bi-partitioning algorithm - CUT = 
FindMinCut(V,E)Y 

19. As to claim 10 (Currently Amended) (incorporating the rejection in claim 
2), Rakhmatov discloses modifying the preliminary pipeline stages comprises (a) 
selecting a preliminary pipeline stage; (b) altering the selected preliminary 
pipeline stage to enable proper transmission of live variables; and control flow to 
and from the selected preliminary pipeline stage; and (c) (a) - (b) for each 
preliminary pipeline stage to form the D-pipeline stages of a parallel network 
application (e.g., Sec. of "ABSTRACT" - a method for mapping nodes of an 
application control flow graph either to software or reconfigurable hardware, 
explicitly targeting minimization of the energy-delay cost due to both computation 
and configuration; using network flow techniques, after transforming the original 
control flow graph into an equivalent network; P. 145, R-CoL, 1^^ Par. Through 
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Par. - the software can directly configure the hardware, which is partially 
reconfigurable; partial reconfiguration allows for a selective change of hardware 
segments of arbitrary size at an arbitrary location, without disrupting the 
operation of the rest of the hardware space; such a capability greatly reduces 
reconfiguration time and energy consumption, because the hardware updates 
are highly localized. Three types of problems: (1) energy-delay product 
minimization, (2) energy minimization under the delay constraint, and (3) delay 
minimization under the energy constraint can be resolved via using network flow 
techniques; specifically, the cost of a node depends whether it is in software or in 
hardware and the cost of an edge depends whether its origin node is in software 
or in hardware and whether its destination node is in software or in hardware; P. 
146, L-Col., 2"^ Par. - 4^^ Par. - the cost/weight can be either the energy or the 
delay or the energy-delay product of a node/edge, weighted by its execution 
frequency; first, transferring control from a hardware, block to a software block is 
more expensive than transferring control from a software block to a software 
block; second, transfen^ing control from a software block to a hardware block is 
more expensive than transferring control from a hardware block to a hardware 
block; P. 147, Sec. of "Constrained Bipartitioning Algorithms" - cost-driven 
constrained bi-partitioning; weight-driven constrained bi-partitioning; P. 148, R- 
Col., 1^^ Par.; Fig. 3 - proposed constrained bi-partitioning algorithms). 



20. As to claim 13 (Original) (incorporating the rejection in claim 12), please 
refer to claim 3 as set forth accordingly. 
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21 . As to claim 14 (Original) (incorporating the rejection in claim 13), please 
refer to claim 4 as set forth accordingly. 

22. As to claim 18 (Original) (incorporating the rejection in claim 12), please 
refer to claim 8 as set forth accordingly. 

23. As to claim 20 (Original) (incorporating the rejection in claim 12), please 
refer to claim 10 as set forth accordingly. 

24. As to claim 33 (Original) (incorporating the rejection in claim 32), 
Robschink discloses the compiler to cause D-1 successive cuts of the flow 
network mode, such that each cut is a balanced, minimum cost cut to form the D- 
preliminary pipeline stages (e.g.. Sec. 1 - Introduction, 4**^ Par. - cost function 
involve costs of all nodes and all edges in the CFG, and not just the edges in the 
cut-set separating software-mapped nodes and hardware-mapped nodes; 
specifically, the cost of a node depends whether it is in software or in hardware, 
and the cost of an edge depends whether its origin node is in software or in 
hardware and whether its destination node is in software or in hardware: Sec. 3 - 
Related Work. 2""^ Par. (Circuit Partitioning) Through 3'"^ Par., 6*^ Par. (Our 
Contribution) - our contribution is to show how a CFG (Control Flow Graph) with 
node and edge costs can be transformed into a network, so that a minimum cut 
in the network corresponds to an optimal bi-partition of the CFG; P. 147, 1®* Par. 
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through 2"'* Par.; Fig. 2 - unconstrained bi-partitioning algorithm - CUT = 
FindMinCut(V,E)). 

25. As to claim 36 (Original) (incorporating the rejection in claini 35), please 
refer to claim 33 as set forth accordingly. 

26. Claims 9 and 19 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Lakshmanamurthy in view of Ziegler, Rakhmatov and 
Robschink and further in view of Goldberg et al., {"A New Approach to the 
Maximum-Flow Problem", 1988, ACM) (hereinafter 'Goldberg') 

27. As to claim 9 (Original) (incorporating the rejection in claim 8), 
Lakshmanamurthy, Ziegler, Rakhmatov, and Robschink do not disclose that 
cutting is performed using an iterative balanced to push-relabel algorithm. 

However, in an analogous art of a new approach to the maximum-flow 
problem, Goldberg discloses cutting is performed using an iterative balanced to 
push-relabel algorithm (e.g., P. 922, 4**' Par. - the algorithm pushes flow through 
the network to find a blocking flow, which determines the acyclic network for the 
next phase; our algorithm maintains a pre-flow in the original network and pushes 
local flow excess toward the sink along what it estimates to be shortest paths in 
the residual graph; P. 924, 4"^ Par. - the pre-flow algorithm works by examining 
vertices other than s and t with positive flow excess and pushing excess fomi 
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them to vertices estimated to be closer to the sink f, with the goal of getting as 
much excess as possible to t.). 

Therefore, it would have been obvious to one of ordinary skill in the art, at 
the time the invention was made to combine the teachings of Goldberg into the 
Lakshmanamurthy-Ziegler-Rakhmatov-Robschink's system to further provide that 
cutting is performed using an iterative balanced to push-relabel algorithm in 
Lakshmanamurthy-Ziegler-Rakhmatov-Robschink system. 

The motivation is that it would further enhance the Lakshmanamurthy- 
Rakhmatov-Ziegler-Robschink's system by taking, advancing and/or 
incorporating Goldberg's system which offers significant advantages that the 
method maintains a pre-flow in the original network and pushes local flow excess 
toward the sink along what are estimated to be shortest paths as once suggested 
by Goldberg (e.g., P. 921, 1'^ Par.). 

28. As to claim 19 (Original) (incorporating the rejection in claim 18), please 
refer to claim 9 as set forth accordingly. 

Allowable Subject Matter 

29. Claims 5-7, 15-17. 22-25 and 27-30 are objected to as being dependent 
upon a rejected base claim, but would be allowable if rewritten to overcome the 
rejections under 35 U.S.C. 103(a) set forth in this office action and to include all 
the limitations of the base claim and any intervening claims. 

The following is an examiner's statement of reasons for allowance: 
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Regarding claims 5-7, 15-17, 22-25, and 27-30. prior art of record fails to 
reasonably show or suggest the specific edge generations having associated 
weights, transformation of the preliminary application program stage, and 
transformation of the control flow as claimed. Specifically, the methods to 
generate edges having an associated weight to connect corresponding program 
nodes to (1) corresponding variable nodes, (2) corresponding controls nodes; the 
method to generate the edges between program nodes and one of the source 
node and the sink nodes; transformation of the preliminary application program 
stages; and transformation of the control flow in details. 

Conclusion 

30. Any inquiry concerning this communication or earlier communications from 
the examiner should be directed to Ben C. Wang whose telephone number is 
571-270-1240. The examiner can normally be reached on Monday - Friday, 8:00 
a.m. - 5:00 p.m., EST. 

If attempts to reach the examiner by telephone are unsuccessful, the 
examiner's supervisor, Tuan Q. Dam can be reached on 571-272-3695. The fax 
phone number for the organization where this application or proceeding is 
assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from 
the Patent Application Information Retrieval (PAIR) system. Status information 
for published applications may be obtained from either Private PAIR or Public 
PAIR. Status information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see http://pair- 
direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EEC) at 866-217-9197 (toll- 
free). If you would like assistance from a USPTO Customer Service 
Representative or access to the automated information system, call 800-786- 



9199 (IN USA OR CANADA) or 571-272-1000. 




BCW ^ 
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